
WHAT IS CLAIMED IS: 

\ y/ A method for detecting an object in video data, 
the^ideo data including a background area and at least one 
object area, the video data comprising frames divided into 
blocks, the method comprising the steps of: 

extracting a moving vector from each block of a frame; 

estimating a movement of the background area in the 
video data by referring to the moving vectors; 

eliminating the estimated movement of the background 
area from the video data; and 

detecting the object area by referring to the moving 
vectors in the video data from which the movement of the 
background area is eliminated. 

2. The object detection method according to claim 1, 
further comprising the step of: 

eliminating a low reliability moving vector from the 
video data after extracting the moving vector of each 
block. 

3. The object detection method according to claim 1, 
wherein the estimating step includes the steps of: 
approximating the movement of the background area by a 

predetermined transformation model; and 

estimating a transformation coefficient of the 
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transformation model by referring to the moving vectors in 
the video data, 

4. The object detection method according to claim 3, 
wherein the transformation coefficient of the 

conversion matrix is estimated by a robust estimation 
method . 

5. The object detection method according to claim 1, 
wherein the estimating step includes the steps of: 
unifying blocks having similar moving vectors, the 

frame being thus divided into a plurality of unified areas; 

estimating the transformation coefficient of each 
unified area; 

clustering the unified areas each having similar 
transformation coefficient; and 

assigning the largest clustered area as the background 
area . 

6. The object detection method according to claim 1, 
wherein the estimating step includes the steps of: 
unifying blocks having similar moving vectors in a 

frame ; 

comparing a plurality of frames each being divided 
into a plurality of unified areas; 

estimating the transformation coefficient of each 
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unified area in the plurality of frames; 

clustering the unified areas each having similar 
transformation coefficient for each frame; 

determining the correspondence of unified areas among 
the plurality of frames; 

correcting the clustering result so that each of 
corresponding unified areas belongs to the same cluster 
among the plurality of frames; and 

assigning the largest clustered area as the background 
area in the plurality of frames. 

An apparatus for detecting an object in video 
dat^, the video data including a background area and at 
least one object area, the video data comprising frames 
divided into blocks, comprising: 

extraction means for extracting a moving vector from 
each block of a frame; 

estimation means for estimating movement of the 
background area in the video data by referring to the 
moving vectors; 

elimination means for eliminating the estimated 
movement of the background area from the video data; and 

object detection means for detecting the object area 
by referring to the moving vectors in the video data from 
which the movement of the background area is eliminated. 
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Q/C A computer readable memory containing computer 
readable instructions for detecting an object in video 
data, the video data including a background area and at 
least one object area, the video data comprising frames 
divided into blocks, comprising: 

instruction means for causing a computer to extract a 
moving vector from each block of a frame; 

instruction means for causing a computer to estimate a 
movement of the background area in the video data by 
referring to the moving vectors; 

instruction means for causing a computer to eliminate 
the estimated movement of the background area from the 
video data; and 

instruction means for causing a computer to detect the 
object area by referring to the moving vectors in the video 
data from which the movement of the background area is 
eliminated. 

A video data description method, comprising the 
stTeps of : 

extracting feature data of\a predetermined object and 
feature data of a background area from a frame of an input 
video; \^ 

describing the feature data of the predetermined 
object and the feature data of the background area as a 




descriptor of the frame; and 



\ 



\ 

\ 
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attaching the descriptor to the frame. 

10. The video d^a description method according to 
claim 9, 

wherein the describing step includes the step of 
describing a difference between the feature data of 
the predetermined object ^nd the feature data of the 

_ „ ...^ 

11. The video data description method according to 
claim 9, 

wherein the feature data of ythe predetermined object 
includes at least position, outward form, and moving 
information of the object, and 

wherein the feature data of the background area 
includes at least moving information of the background 
area. 



\ 

12. The video data description meth'od according to 



claim 9, 



wherein the descriptor includes a frame number, a 

\ 

pointer to a next descriptor, the feature da^-ta of the 

\ 

background area, and the feature data of each^^bject in the 



frame 



13. The video data description method according to 
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claim 12, 

whereinVthe descriptor is created from a corresponding 
frame at an iVterval of a predetermined number of frames in 
the video strear 

14^ A computed readable memory containing computer 
readable instructions, comprising: 

instruction means Nf or causing a computer to extract 
feature data of a predet^ermined object and feature data of 
a background area from a f^rame of an input video; 

instruction means for causing a computer to describe 
the feature data of the predetermined object and the 
feature data of the background a^ea as a descriptor of the 
frame; and 

'v 

instruction means for causing a computer to attach the 
descriptor to the frame. 



A video data retrieval method, comprising the 

stej/s of: 

describing feature data of a predetermined object and 
feature data of a background area for each frame in a 
plurality of frames; 

inputting feature data of a retrieval object; 

calculating a difference between the feature data of 
the background area and the feature data of the 
predetermined object for each frame; 
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comparing the difference with the feature data of the 
retrieval object; and 

retrieving the predetermined object or the frame 
including the predetermined object matched with the 
retrieval object from the plurality of frames according to 
the comparison result, 

16. The video data retrieval method according to 
claim 15, 

wherein the describing step includes the step of: 
describing the difference between the feature data of 

the background area and the feature data of the 

predetermined ob j ect ♦ 

17. The video data retrieval method according to 
claim 15, 

further comprising steps of: 

inputting feature data of the background area 
representing camera work; 

comparing the input feature data with the feature data 
of the background area of each frame; and 

retrieving the frame obtained by a camera work matched 
with the camera work of the input feature data from the 
plurality of frames according to the comparison result. 



18. The video data retrieval method according to 



claim 15, 

wherein the feature data of the predetermined object 
includes moving information of the predetermined object for 
each frame, and 

further comprising the steps of: 

inputting a moving information of the retrieval 
ob j ect ; 

sampling the moving information of the retrieval 
object at the frame interval; 

comparing the sampled moving information with the 
moving information of the predetermined object of each 
frame; and 

retrieving the predetermined object or the frame 
including the predetermined object matched with the 
retrieval object from the plurality of frames according to 
the comparison result. 

19^. A video data retrieval apparatus, comprising: 
yrnemory means for storing a descriptor of feature data 
of a predetermined object and feature data of a background 
area for each frame in a plurality of frames; 

input means for inputting feature data of a retrieval 
ob j ect ; 

calculation means for calculating a difference between 
the feature data of the background area and the feature 
data of the predetermined object; and 



retrieval means for retrieving the predetermined 
object or the frame including the predetermined object 
matched with the retrieval object from the plurality of 
frames by comparing the difference with the feature data of 
the retrieval object. 



instruction means for causing a computer to describe 
feature data of a predetermined object and feature data of 
a background area for each frame in a plurality of frames; 

instruction means for causing a computer to input 
feature data of a retrieval object; 



calculate a difference between the feature data of the 
background area and the feature data of the predetermined 
ob j ect ; 

instruction means for causing a computer to compare 
the difference with the feature data of the retrieval 
object; and 

instruction means for causing a computer to retrieve 
the predetermined object or the frame including the 
predetermined object matched with the retrieval object from 
the plurality of frames according to the comparison result. 



read 




instructions , comprising : 



readable memory containing computer 



instruction means for causing a computer to 




